Distributed Neighborhood Attention S2 by azrael417 · Pull Request #161 · NVIDIA/torch-harmonics

azrael417 · 2026-03-30T13:49:20Z

This MR adds distributed Neighborhood Attention S2 support and fixes some issues in the existing attention kernel.

existing serial attention kernel preallocated an output tensor which was too large when using attention based downsampling. This is fixed
existing serial attention kernel does not produce the correct v gradient when used in upsampling. We will fix that next
this MR adds distributed neighborhood attention along with some new tests for the feature. This kernel does not support up- or downsampling yet

bonevbs · 2026-04-15T12:40:54Z

Please bump the version number to 0.9.1a and start the Changelog for v0.9.1.

bonevbs

Please bump the version number to 0.9.1a1 and start the Changelog for v0.9.1

Also some minor comments

bonevbs · 2026-04-16T12:53:25Z

 def distributed_transpose_polar(input_, dims_, shapes_):
    return _DistributeTransposePolar.apply(input_, dims_, shapes_)

+@torch.compiler.disable()


why do we need those?

this is important so that the graph will properly break here

bonevbs · 2026-04-16T12:57:18Z

@rietmann-nv can you also have a look at the new CUDA kernels for distributed spherical attention

azrael417 requested review from bonevbs March 30, 2026 16:10

azrael417 self-assigned this Mar 30, 2026

azrael417 requested review from rietmann-nv March 30, 2026 16:10

azrael417 force-pushed the tkurth/distributed-neighborhood-attention branch 3 times, most recently from b75d634 to 4d434d6 Compare April 6, 2026 07:45

azrael417 force-pushed the tkurth/distributed-neighborhood-attention branch from da78cae to 46fb343 Compare April 16, 2026 08:23

azrael417 and others added 15 commits April 16, 2026 03:52

working tests except for downsampling

6e53596

adding missing files

02443c7

slight refactoring

d15fb52

Fixed a typo in bwd kernels early-exit checks.

db3e88f

distributed attention working again

48f7510

working downsampling attention

03b5b2c

adding shape checks

83457ea

added wrong shape assertion test in attention

efb151c

adding qknorm

fb46eae

added qknorm to attention tests

619403b

fixing weight inits

4c2d6a8

adding upsampling test with attention

8a21178

fixed testutils for more stable distributed test teardown

430c405

improving pt2 compatibility

b1c8a03

making distributed transpose more robust

b9a386e

azrael417 force-pushed the tkurth/distributed-neighborhood-attention branch from 46fb343 to b9a386e Compare April 16, 2026 10:52

updated changelog

482d572

azrael417 marked this pull request as ready for review April 16, 2026 12:40

bumping version

025879d

bonevbs reviewed Apr 16, 2026

View reviewed changes

bonevbs requested a review from apaaris April 16, 2026 12:59

small changes

44763ed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed Neighborhood Attention S2#161

Distributed Neighborhood Attention S2#161
azrael417 wants to merge 18 commits intomainfrom
tkurth/distributed-neighborhood-attention

azrael417 commented Mar 30, 2026

Uh oh!

bonevbs commented Apr 15, 2026

Uh oh!

bonevbs left a comment

Uh oh!

Uh oh!

Uh oh!

bonevbs Apr 16, 2026

Uh oh!

azrael417 Apr 16, 2026

Uh oh!

bonevbs commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

azrael417 commented Mar 30, 2026

Uh oh!

bonevbs commented Apr 15, 2026

Uh oh!

bonevbs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bonevbs Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

azrael417 Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

bonevbs commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants